GiantMIDI-Piano: A Large-Scale MIDI Dataset for Classical Piano Music

نویسندگان

چکیده

Symbolic music datasets are important for information retrieval and musical analysis. However, there is a lack of large-scale symbolic classical piano music. In this article, we describe the creation GiantMIDI-Piano (GP) dataset containing 38,700,838 transcribed notes 10,855 unique solo works composed by 2,786 composers. We extract names composers from International Music Score Library Project (IMSLP). search download their corresponding audio recordings Internet. further create curated subset 7,236 1,787 where titles downloaded contain surnames apply convolutional neural network to detect works. Then, transcribe those into Musical Instrument Digital Interface (MIDI) files using high-resolution transcription system. Each MIDI file contains onset, offset, pitch, velocity attributes pedals. includes 90% live performance 10% sequence input files. analyse statistics show pitch class, interval, trichord, tetrachord frequencies six different eras that can be used evaluate quality in terms detection F1 scores, metadata accuracy, error rates. release source code acquiring at https://github.com/bytedance/GiantMIDI-Piano.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Piano Music Companion

We present a system that we call ‘The Piano Music Companion’ and that is able to follow and understand (at least to some extent) a live piano performance. Within a few seconds this system can identify the piece that is being played, and the position within the piece. It then tracks the progress of the performer over time via a robust score following algorithm. The companion is useful in multipl...

متن کامل

Detection of Key Change in Classical Piano Music

Tonality is an important aspect of musical structure. Detecting key of music is one of the major tasks in tonal analysis and will benefit semantic segmentation of music for indexing and searching. This paper presents an HMM-based approach for segmenting musical signals based on key change and identifying the key of each segment. Classical piano music was used in the experiment. The performance,...

متن کامل

Strategies towards the Automatic Annotation of Classical Piano Music

Analysis and description of musical expression is a substantial field within musicology. However, manual annotation of large corpora of music, a prerequisite for describing and comparing different artists’ styles, is very laborintensive. Therefore, computer systems are needed that can annotate recordings of different performances automatically, requiring only minimal corrections by the user. In...

متن کامل

Unsupervised Transcription of Piano Music

We present a new probabilistic model for transcribing piano music from audio to a symbolic form. Our model reflects the process by which discrete musical events give rise to acoustic signals that are then superimposed to produce the observed data. As a result, the inference procedure for our model naturally resolves the source separation problem introduced by the the piano’s polyphony. In order...

متن کامل

Towards Automatic Music Transcription: Extraction of MIDI-Data out of Polyphonic Piano Music

Driven by the increasing amount of music available electronically the need of automatic search and retrieval systems for music becomes more and more important. In this paper an algorithm for automatic transcription of polyphonic piano music into MIDI data is presented, which is a very interesting basis for database applications and music analysis. The first part of the algorithm performs a note...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the International Society for Music Information Retrieval

سال: 2022

ISSN: ['2514-3298']

DOI: https://doi.org/10.5334/tismir.80